Building Integrated Systems for Data Representation and Analysis in Molecular Biology
نویسندگان
چکیده
of eukaryotic genomes centered on the relations between genomic sequences and their chromosomal localization [8]. In their early stages of development, ColiGene and MultiMap integrated a few sequence analysis methods. But, little by little, we found useful to provide access to the methods associated with a peculiar knowledge base to the other. Moreover, it was obvious that the methods integrated in ColiGene and MultiMap lacked of standardization and that their integration in the knowledge bases was a bit artificial (i.e. they were more associated pieces of software than really integrated tools). This is why we have decided to separate methods from the biological objects, while maintaining communication possibilities between these two kind of knowledge. To this purpose, we have developed two complementary packages: Digit and Misa. The first one was primarily designed as a graphical layer for a knowledge base devoted to the methodology in multivariate analysis: Slot [9]. The genericity of the tools available in Digit allows their use with any knowledge base developed with the Shirka system. After Digit, we have developed Misa, a more specific system which is able to virtually integrate any available sequence analysis method, this due to its modular conception. The modules that are part of the Misa system are now our basis in the building of a more advanced system in which methods — defined as “tasks” — are managed under an “intelligent” system guiding the user in the choices and the chainings to do in a way to perform complex analyses. Biochemical techniques have given to biology experimental tools for sequencing genome fragments. These fragments were of a short length — often corresponding to a single gene — this until the development of extremely fast sequencing methods. By now, large sequencing projects have been started, leading to the availability of huge continuous fragments (300 kb for yeast chromosome III). Besides the problem of storing and representing such an amount of data, there is also the fact that the internal organization of these large fragments is often completely unknown. Consecutively, in the near future, the data and knowledge bases dedicated to molecular biology will be most certainly associated to sequence analysis systems in a way to help study the unannotated fragments. We present here an example of such association between two “biological” objectoriented knowledge bases — ColiGene and MultiMap — and two sets of methods that are able to communicate with them. Then, we show how it is possible to formalize the methods under an object-oriented knowledge representation model for tasks.
منابع مشابه
Defining the Building-Integrated Photovoltaic Designs in architecture and urbanism with Emphasis on Usage in Iran
Demand of energy is more sensible now because of population growth and economic growth of countries. In order to increase of energy consumption around the world and environment pollution, attention to the renewable energy such as solar energy is inevitable. Building-integrated photovoltaic (BIPV) electric power systems not only produce electricity, they are also part of the building. For exam...
متن کاملIntegrated Power Management System for Zero-energy building: Techno-Economic Analysis
A complete understanding of the limits, barriers, and equipment available for the use of national energy resources is essential. This will increase the use of existing regional energy potential and pave the way for sustainable development. In this study, the Green Cottage power management system from a renewable energy source was designed and evaluated using a decision-making algorithm. Thirtee...
متن کاملDeveloping a BIM-based Spatial Ontology for Semantic Querying of 3D Property Information
With the growing dominance of complex and multi-level urban structures, current cadastral systems, which are often developed based on 2D representations, are not capable of providing unambiguous spatial information about urban properties. Therefore, the concept of 3D cadastre is proposed to support 3D digital representation of land and properties and facilitate the communication of legal owners...
متن کاملUsing two retrotransposon-based marker systems (SRAP and REMAP) for genetic diversity analysis of Moroccan Argan tree
The Argania is an endemic genetic resource in Morocco holding an important ecological and socio-economical benefit. However, overgrazing and overharvesting lead to a serious downturn in the number of trees. To characterize genetic diversity within and among 24 populations, represented by 240 argan trees, four combinations of SRAP primers and eight combinations of REMAP primers were used. A tota...
متن کاملRepresentation a Model of Integrated Curriculum to Happiness(joysfulness) for Primary School
Purpose: The aim of this study was to provide an integrated curriculum model for the revitalization of primary schools. Methodology: The research is descriptive-survey in terms of nature and method of data collection and applied in terms of purpose. The statistical population of this research consists of professors and experts in the field of educational sciences and experts of the General Dep...
متن کاملAn Integrated Computing Environment for Bio-Molecular Networks
With the development of systems biology, more and more researchers focus on the study of biomolecular networks. In recent years, researchers in different fields have accumulated a large number of biological experimental data and algorithms for analysis and calculation of bio-molecular networks, but these data and methods are relatively independent, difficult to be utilized by biologists. Based ...
متن کامل